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Abstract 

Network MIMO is considered to be a key solution for the next generation wireless systems in breaking the 
interference bottleneck in cellular systems. In the MIMO systems, open-loop transmission scheme is used to support 
mobile stations (MSs) with high mobilities because the base stations (BSs) do not need to track the fast varying 
channel fading. In this paper, we consider an open-loop network MIMO system with K BSs serving K private MSs 
and M c common MS based on a novel partial cooperation overlaying scheme. Exploiting the heterogeneous path 
gains between the private MSs and the common MSs, each of the K BSs serves a private MS non-cooperatively 
and the K BSs also serve the M c common MSs cooperatively. The proposed scheme does not require closed loop 
instantaneous channel state information feedback, which is highly desirable for high mobility users. Furthermore, 
we formulate the long-term distributive power allocation problem between the private MSs and the common MSs 
at each of the K BSs using a partial cooperative game. We show that the long-term power allocation game has a 
unique Nash Equilibrium (NE) but standard best response update may not always converge to the NE. As a result, 
we propose a low-complexity distributive long-term power allocation algorithm which only relies on the local long- 
term channel statistics and has provable convergence property. Through numerical simulations, we show that the 
proposed open-loop SDMA scheme with long-term distributive power allocation can achieve significant performance 
advantages over the other reference baseline schemes. 

Index Terms 

Network MIMO, Cooperative BS, Open-loop Transmission, Resources Allocation 

I. Introduction 



Inter-cell interference (ICI) has been widely considered as a critical performance bottleneck for wireless commu- 
O nications in the cellular networks 12, Q. For instance, mobile stations (MSs) at the cell edge (within the coverage 
of multiple base stations (BSs)) are usually interference limited. To alleviate the interference issues, traditional 
cellular systems employ frequency reuse so as to control the interference at the expense of poor spectral efficiency 
0. On the other hand, network MIMO communications H, are considered to be a key solution for the next 
generation wireless systems in breaking the interference bottleneck in cellular systems. The idea of network MIMO 
communications is to utilize cooperation among multiple BSs for joint signal processing in the uplink/downlink 
directions. Through cooperation, the undesired ICI can be transformed into useful signals via the collaborative 
transmission among adjacent BSs 0, Q. 

One key challenge in network MIMO systems is on how to spatially multiplex multiple MSs effectively and 
efficiently. Traditionally, linear precoding (such as Tx-MMSE flHJ or zero-forcing 0) could be used to spatially 
multiplex MSs but closed loop knowledge of instantaneous channel state information (CSI) is required at the BS and 
this refers to closed-loop spatial division multiplexing access (SDMA) ifTOl . ifTTl . However, the closed-loop SDMA 
schemes only work for low mobility MSs where the channel conditions remain quasi-static within the transmission 
duration. For the high mobility MSs, it is very difficult to keep track of the channel state information (CSI) at the 
BSs and hence, the above closed-loop schemes cannot be applied for high mobility MSs. 
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Open-loop transmission scheme has been widely considered in the existing literature for high mobility users. By 
open-loop schemes, we mean that instantaneous CSI knowledge is not required at the BS. For example, in lfT2l . 
lfT3ll . the authors proposed an open-loop transmission scheme, namely the space-time block code (STBC) for point- 
to-point scenarios. In |[14l . double-STTD which is able to serve two users simultaneously has been proposed for 
4 transmit antenna and 2 receive antenna MIMO link to fully exploit the spatial diversity and spatial multiplexing 
gains. All the above open-loop schemes only work in a single cell scenario and there are several important technical 
challenges to extend to multi-cell systems. They are elaborated below. 

• Heterogeneous Path Gain and Shadowing Effect In the network MIMO systems, the path gains of different 
MSs are quite different. The heterogeneous path gain effect for different types of MSs leads to significant 
power efficiency loss and hence the conventional open-loop transmission schemes (which have ignored the 
heterogeneous path gain and shadowing effects) cannot be directly applied in the network MIMO. 

• Dynamic and Heterogeneous MIMO Configurations In the network MIMO systems, the number of co- 
operating BSs is changing dynamically and hence, we are not able to use existing STBC structures in such 
dynamic MIMO configurations (with time varying number of transmit antennas). 

In this paper, we consider a network MIMO system with multiple BSs and multiple high mobility MSs3. We 
propose a novel open-loop scheme to serve K private MSs and M c common MSs simultaneously based on novel 
partial cooperative overlaying. Specifically, each BS serves a private MS non-cooperatively. By exploiting the 
path gain difference between the common MSs and the private MS, the K BSs also serve the M c common MSs 
cooperatively at the same spectrum as the private MSs. The proposed scheme does not require knowledge of 
instantaneous CSI at the BS and supports dynamic and flexible network MIMO configurations. Furthermore, to 
adjust the long-term power allocation between the common MSs and the private MS at each of the K BSs, we 
formulate the long-term distributive power control problem using a partial cooperative game formulation. We show 
that the long-term power control game has a unique Nash Equilibrium (NE) but the conventional best response 
update algorithm cannot always converge to the NE. As a result, we propose a low-complexity distributive long-term 
power allocation algorithm which only relies on the local channel statistics and has provable convergence property. 
Through numerical simulations, we show that the proposed open-loop overlaying scheme with a distributive long- 
term power allocation algorithm can achieve significant performance advantages over the traditional schemes and 
the distributive algorithm has negligible performance loss compared with the centralized power allocation scheme. 

A. Notations 

We adopt the following notation conventions. Boldface upper case letters denote matrices, boldface lower case 
letters denote column vectors, and lightface italics denote scalers. C nxm denotes the set of n x m matrices with 
complex- valued entries and the superscript (.) H denotes Hermitian transpose operation. The matrix I n denotes the 
n x n identity matrix. Expressions P r (x) denotes the probability density function (p.d.f.) of the random variable 
x. The expectation with respect to x is written as E x [-] or simply as E[-]. 

II. System Model 

Consider a cellular network where there are K base stations (BSs) and M high mobility mobile stations (MSs) 
as shown in Figure [TJ We assume each BS is equipped with N t transmit antennas^ and each MS is equipped with 
N r receive antennas. 

Denote T to be the transmission time intervals^ and G C NtXT tQ ^ e ^ transmitted signals from the k-th BS. 
The received signals of m-th MS, denoted by Y m £ C NrXT , can thus be modeled as follows, 

K 

Y m = VPkLmkHmkSk + Z m , Vm = 1, 2, . . . , M (1) 
fc=i 

'As a result, the BS does not have knowledge of instantaneous channel state information (CSI) of the MSs. 

2 In this paper, we focus on the case where all the BSs have the same number of antennas. As we elaborate later, the proposed scheme 
can be directly applied to the dynamic and heterogeneous MIMO configurations with little modification. 

'Transmission time interval is defined to be the time duration where the channel fading coefficients in the multi-cell network MIMO 
systems remain quasi-static. 
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where H m fc G C rX ' is the normalized complex fading coefficients from the k-th BS to the m-th MS, Z. m G C ' 
is the additive white complex Gaussian noise (AWGN) with zero mean and unit variances, denotes the transmit 
power of the /c-th BS and L m k denotes the long-term path gain and shadowing from the £>th BS to the m-th MS. 

The following assumptions are made through the rest of the paper. Firstly, all the receivers in the system have 
perfect CSI of each corresponding link, i.e. the Z-th MS has the perfect CSI knowledge from the /c-th BS. Secondly, 
we assume all the BSs have no instantaneous CSI knowledge {H m k, m = 1, 2, . . . , M, k = 1,2, ... , K}. Thirdly, 
all the entries of the channel coefficient matrix {H m k, m = 1,2,..., M , k = 1,2,..., K} are independent and 
identical distributed (i.i.d.) complex Gaussian random variables with zero mean and unit variance. Moreover, we 
consider block fading channels where the aggregate CSI H = {H m k, m = 1,2,..., M, = 1,2,..., K} remains 
quasi-static within a fading block (i.e. the transmission time interval T) but varies between different fading blocks. 



In this section we shall first introduce a user scheduling algorithm, which classifies the high mobility MSs into 
K private MS sets (one for each BS) and a common MS set (shared by all the BS). Based on the user scheduling 
algorithm, we propose a novel open loop scheme to overlay the MSs in the common set and the private sets 
simultaneously using partial cooperation. We shall then discuss the problem formulation of the long-term power 
allocation control in what follows. 

A. Long-term User Scheduling Algorithm 

We first define the private MS set and the common MS set below. 
Definition 1 (Common/Private MS Sets): 

• k-th Private MS Set: The fc-th private MS set U v k consists of one MS (the m-th MS) in which the long-term 
path gain and shadowing configuration {L rn i, L m 2, . . . , L m K] satisfies the following criteria: L m k — L m j > 
£?,Vj 7^ k, where £j? is the k-th private MS set threshold. 

• Common MS Set: The common MS set U c consists of at most M c MSs such that \L m k — Yl,f=i L m j\ — 
£ c , Vfc = 1,2, ... ,K for all m G U c , where £ c is the common MS set threshold. 

Remark L To ensure U c f}U% = for all k = 1, . . . ,K, the thresholds need to satisfy ^ > for all 

k = 1, . . . ,K. As such, the private MS sets consist of MSs closer to the home cell whereas the common MS set 
consists of the MSs closer to the "coverage overlap areas" between the BSs. 

Remark 2: On the other hand, a MS may belong to neither of the above two set. In that case, the MS is not 
selected to be the "common MS" or the "private MS" and does not participate in the "open loop overlaying scheme". 
This MS may be served in the normal way (e.g. assigned another sub-band). Since the MS are moving around, this 
particular MS may be able to be selected as the "common MS" or "private MS" in some future time. 

Algorithm 1 illustrates a low complexity user scheduling algorithm to construct U?, k = 1, . . . , K and U c based 
on the local long-term path gain and shadowing at each of the MSs. 

B. Signal Model for the Private/Common MS 

Given the user sets Ul, k = 1, . . . , K and U c , the received signal of the m-th MS in the private and common 
MS sets, denoted by Y m , is given by: 



Remark 3: At the private MS, inter-cell interference doesn't appear in the received signal model because the 
inter-cell interference at the private MS is very weak and negligible. For instance, if = 20dB, k = 1, 2, . . . , K, 
the inter-cell interference would be 100 times less than the useful signal. 



III. Problem Formulation 




m G U c 



(2) 
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Algorithm 1 Long-term User Scheduling Algorithm 
. Step 1: MS Broadcast: 

At the m-th (m = 1,2,..., M) MS side, the m-th MS measures path gains L mk , k = 1, . . . , K(in dB) from all 
the K BSs. According to Definition!]] if there exists L m k such that L m k — L m j > Vj ^ k, then the m-th MS 
labels itself as a potential member of the kth private MS set; if VA; = 1, 2, . . . , K, \L m j — ^2f=i L m j\ < £ c , 
then the m-th MS labels itself as a potential member of the common MS set. Those MS being potential 
members of the private set or the common set will then broadcasts its (label,BS_ID) to all the K BSs. 

. Step 2: Formation of the Private MS Set: 

Denote U k ,k = 1, 2, . . . , K to be the k-th private MS set. The k-th BS picks one MS with label = "private" 
and BS_ID = k to be the member of the private MS set W?, k = 1, 2, . . . , K randomly. 

• Step 3: Formation of the common MS Set: 

Denote U k to be the potential common MS set at the k-th BS. Assign the m-th MS to U k at the k-th BS if 
the label from the m-th MS is "COMMON". Each BS then submits Ui to the base station controller (BSC). 
At the BSC, the common MS set U c is chosen as an intersection of U k , k = 1, 2, . . . , K, i.e. f] k=l U k . If the 
number of members in the intersection exceeds M°, then M c users will be selected randomly. 



C. Open-Loop Overlaying Transmission/Dection Scheme 

1) Open-Loop Overlaying Transmission Scheme: Consider the information streams for the M c MSs in the 
common MS set (denoted by Xy, j G U c ) and the information streams for the l-th MS (in the k-th private MS 
set) (denoted by X;,Z G U k ) are transmitted over the N t antennas at the k-th BS. To exploit the possible diversity 
provided by the transmit antenna arrays, orthogonal space-time block code (OSTBC) lfl~2ll . lfl5l scheme is applied 
for transmission, which spans over the entire transmitting antennas. The information streams (X;,/ G U c ) for the 
M c common are jointly encoded as the OSTBC S c and the information streams for the MS in the k-th private 
MS set is encoded as the OSTBC as shown in Fig. |2 Without loss of generality, we assume at the k-th BS, 
the two OSTBCs and S c are delivered through Nf and Nf transmit antennas respectively with Nf + N£ = N t . 
So the transmitted symbols at the fc-th BS are given by S k = [(S^) T (S C ) T ] T . 

For illustration purpose, let us consider a specific case with M c = 2. Assume N t = 4 with Nf = N£ = 2. At 
each BS, the two information streams for two MSs in the common MS set respectively are jointly OSTBC encoded 
into one Alamouti's structure lfT51 and the information streams for the k-th private MS set are encoded as the other 
Alamouti structure. The whole transmit structure is also known as double space-time transmit diversity (D-STTD) 
lfT4l . The transmitted structure at the /c-th BS is given by: 
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(3) 



, 9 p k is the power allocation ratio for the private MS set in 



the coverage of the k-th BS and Q c k is the power allocation ratio for the common MS set at the k-th BS. 9 k and 6 k 
satisfy the relation Q\ + 9i = 1. 

The proposed open-loop oveylaying scheme has the following advantages. 

• Exploiting the Heterogeneous Path Gain: As we have mentioned before, the heterogeneous path gain effect 
for different types of MSs leads to significant power efficiency loss and hence the conventional open-loop 
transmission schemes cannot be directly applied in the network MIMO system. With the proposed open-loop 
overlaying scheme, the common MS and the private MS can be simultaneously served. Due to the long- 
term power splitting ratio 9i and 6 p k , we can efficiently control the ICI generated at the common MS side 
under different path gain configurations through the carefully designed long-term power allocation schemes to 
enhance the power efficiency for the private MS. 



4 M C is limited to be less or equal to the number of streams in OSTBC S c . 
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• Exploiting Flexible MIMO Configurations: In the proposed open-loop overlaying scheme, we could accom- 
modate dynamic and heterogenous MIMO configurations in the systems. This can be illustrated through the 
following simple example. Consider three cooperative BSs with heterogeneous MIMO configurations, e.g. BS1 
is equipped with 4 antennas, BS2 is equipped with 6 antennas and BS3 is equipped with 3 antennas. Due to 
mobility of users, assume BS1 and BS2 cooperatively serve one common MS in the first time slot and BS2 
and BS3 cooperatively serve the common MS in the second time slot. In the traditional open-loop overlaying 
scheme, the STBC design has to accommodate BS3 with three transmit antennas for both time slots. However, 
for the proposed open-loop overlaying scheme as illustrated in Fig. |2j BS1 and BS2 can use the remaining 2 
and 4 transmit antennas for the private MSs . In the second time slot, BS2 and BS3 can perform the similar 
operations to serve the private MSs with the remaining transmit antennas. As a result, the proposed open-loop 
overlaying scheme offers flexibility with respect to dynamic and heterogeneous MIMO configurations in the 
systems. 

2) Open-loop Overlaying Detection Scheme: Applying the above transmission scheme, the received signals at 
the common MS can be modeled as: 



K 



^ V PkLmk [H m k,l 
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Similarly, the received signals at the private MS can be modeled as: 
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and S c denote the OSTBC encoded transmitted matrices for the 



Signal Part 

where H mM G C^ x ^; H mfej2 € C N -* N > 

private MS in the coverage of k-th BS and the common MS set with entries ±s p v ±s|'*, . . ., ±s^ RPT , ±s P k * RPT 
and ±s k v ±Sfc* s • • ReT , ±s^ cT respectively. R p k is the encoding rate for the OSTBC and R c is the 

encoding rates for the OSTBC S c . 

At the common MS side, the received signals are radio-frequency(RF)-combined to exploit the macro-diversity. 
Based on (0]), each MS in the common MS set shall detect the whole OSTBC S c by treating the interfering streams 
S?, k = 1, 2, ... ,K as noised and then take the desired stream from the decoded OSTBC S c . At the private MS 
side, the received signal in ([5]) corresponds to a "strong interference" scenario and hence, the MS in the A;-th private 
MS set shall first detect the interfering streams S c and then perform successive interference cancelation (SIC) to 
detect its own information streams Sf. 

Using the OSTBC transmission structure and the above detection schemes, the throughput expressions of the 
common MS and private MSs are summarized in the following lemma. 

Lemma 1 (Achievable Throughput): Using the open-loop overlaying transmission and detection scheme described 
above, the achievable throughput C m of the MSs in the common and private MS sets is given by: 



C 



m G W 

log{l + P k L mk e p k R p k ), 

meUl (k = l,...,K) 



(6) 



Since the interfering streams are contributed by the transmission to the private MSs, the power is much smaller due to the heterogeneous 
path gain (Ljk <C Lik,j 6 U c ,1 € W£) and hence, such detection scheme is reasonable for the weak interference scenarios [T), 1161 . 
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where C) = log (l + i^fp^.^ ) > J G ZY C and Cf = log (l + r^Sfe ) for D is the total number 

of streams of OSTBC S c , D m is the number of streams for the m-th MS (in the common MS set); R p k and R c are 
the encoding rate for the OSTBC and S c respectively. 

Proof: Please refer to Appendix |A] for the proof. ■ 
Remark 4: The approximation in © is quite tight over a wide range of SNR as illustrated by Figure [7] The 
physical meaning of C,-,j G U c is the maximum decodable rate for OSTBC S c at the j-th MS (in the common 
MS set) by treating the streams (S?, k = 1, . . . , K) for the private MSs as noise and Cf , Z G Ui is the maximum 
decodable rate at which the Zth MS (in the k-th private MS set) can successfully decode the OSTBC S c by treating 
the streams (S?) for its own as noise. Hence our detection scheme can always work when the transmission rate is 
given in ©. 

D. Long-term Power Allocation Problem Formulation 

It is very important to adjust the long-term power allocation ratio {9?, 9i} to fully exploit the heterogenous path 
gain and shadowing effect over the network MIMO configuration. In this paper, we consider choosing {9 p k , 9 k } to 
maximize the minimum weighted throughput (with approximation) which is defined as follows. 

Definition 2 (Minimum Weighted Throughput): Define {w m ,m = 1,... , M} to be the positive static weighjH 
which is determined by the Quality-of-Service (QoS) requirement or priority of the m-th MS. The minimum 
weighted throughput of all the MSs (using the approximation in Lemma [B throughput £ is given by: 

C({0£,0£})= min {WmCm} (7) 

mG(Uf =1 «£)U«c 

where C m is giverQ in Q. 

Hence, the optimal long-term power allocation problem can be found by solving the following optimization 
problem. 

({r^n)=arg max 

subject to 9\ + 9% = 1, 

Vke{l,...,K} (8) 

In general, the above optimization problem ([8]) is non-trivial because of the following reasons. The approximate 
throughput expression C m involves complicated operations of the power splitting ratio which is a com- 

position of logarithm and nonlinear SINR expression. Hence, the objective function is in general non-convex and 
the standard low complexity algorithms cannot be directly applied. Moreover, as shown in the current literature, 
this type of problems belongs to the minimum throughput maximization problem for the multi-cell architecture and 
does not exist a trivial global optimal solution in general lfT7l . 

IV. Distributive Long-term Power Allocation Algorithms 

In this section, our target is to propose a distributed long-term power allocation algorithm based on solving 
the optimization given by © in a distributed manner. We formulate the long-term distributive power allocation 
problem using a partial cooperative game. We show that the long-term power allocation game has a unique Nash 
Equilibrium (NE). Furthermore, we propose a low-complexity distributive long-term power allocation algorithm 
which only relies on the local long-term channel statistics and has provable convergence to the NE. 

6 These QoS weights are determined by the application requirement or the priority class of the MS and is determined when the 
communication session is setup. 

7 In fact, C m shall be a function of the power allocation ratio However, we drop them whenever there is no confusion caused 

through the rest of the paper for notation convenience. 
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1) Partial Cooperative Game Formulation: We formulate the long-term power allocation design within the 
framework of game theory as a partial-cooperative game, in which the players are the BSs in the wireless network 
and the payoff functions are the minimum weighted throughput of MSs in the coverage of each BS. The k-th player 
(BS) competes against the others by choosing his power allocation ratio 9 k given other players' power allocation 
ratio to maximize the minimum weighted throughput #f.fc)> which is given by 



7 it> 



o c _ k ) = mm{ti(e c k ,e c _ k )jl(e c k ,e c _ k )} (9) 



where 



flm, B c _ k ) = min { mmiwj^Cj}, mm{ Wj ^ , I e U p }) 

= min f mm{w 3 ^te}, min{™ 7 -f }C?, I £ U? \ (10) 
I jaw D 3 7'e« c D k ) 



g 1 ^") am) 
is the minimum weighted throughput (with approximation) of all the MSs in the common MS set, 

fim,e c _ k ) = w m C m ,mGK P k 

= w m log(l + P k L rnk {l-9 c k )R p k ),meU p (11) 

is the weighted throughput (with approximation) of the MS in the £>th private MS set and 6 c _ k = (6g)^ =l q ^ k is 
the set of long-term power allocation ratios of all the BSs except the £>th one. Hence, the partial-cooperative game 
is formulated as: 

[Wh subject to k e% VKfc ^' W 

where ,J€ = {1, . . . , K} denotes the set of all players, i.e. the BSs, <£ k (9 k ,Q c i k ) defined in © is the payoff functions 
of the player k and & k is the admissible strategy set for player k, defined as ^ = {6 G M : < < 1}. The 
solutions of the above game §f are formally defined as follows. 

Definition 3 (Nash Equilibrium): A (pure) strategy profile of the long-term power allocation 6 C * = (6 k *)ke.^ G 
@ k X ... x @k is a NE of the game 5f if 

£ fe (^r, 0%) > <£ k (6 c k , 0<zl), V9 c k G %, Vk e X. (13) 

At a NE point, each BS k, given the long-term power allocation profile of other BSs O'lt., cannot improve its 
utility (or payoff) by unilaterally changing his own long-term power allocation strategy 6 k . The absence of NE 
simply means that the distributed system is inherently unstable. In order to obtain the distributed solution, we shall 
characterize the properties of NE such as the existence and uniqueness through the following theorem. 

Theorem 1 (Existence and Uniqueness of NE): The strategic partial-cooperative game §f has the following two 
properties: 

1) There exists a NE for the strategic partial-cooperative game . 

2) Moreover, the NE is unique. 

Proof: Please refer to Appendix |B] for the proof. ■ 
From Theorem [TJ we can well establish the properties of the strategic partial-cooperative game which is 
shown to have a unique NE, and hence the distributed system is stable. In the following part, we shall develop the 
corresponding distributive algorithm to achieve the optimal long-term power allocation for each BS in the overlaid 
network architecture. 

2) Algorithm Description: Since the above partial-cooperative game has a unique NE point, we shall try 
to develop the proper algorithm to find the desired point. Traditionally, in a partial-cooperative game, the best- 
response update algorithm fl8l is shown to achieve good performance. However, the convergence property cannot 
be guaranteed due to the reason that there might be some best response cycles |fT9l . In our partial-cooperative 
game, the best response update can be shown to be not converging to the NE in some cases. In what follows, we 
shall propose a novel algorithm with provable convergence property. 



8 For fixed Q v k can be uniquely determined through Q v k — 1 — 
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Algorithm 2 Distributive Long-term Power Allocation Algorithm 

• Step 1: Initialization: 

Set iteration index i = 1. 

For each base station / £ J?f choose some power the splitting ratio r\?* S [0, 1) and rjf'* = 1 — r/f'* such that 
9i{Vi'*) = ff(jfi*\ Under this initial power allocation setup r) c '* = [if{* , rfe* , ■ ■ ■ ,Vl*]> th e common MS 
broadcasts the measured receive SINR r m ; n (l) to all the cooperative BSs. 

Each BS solve the equation w c log(l + r min (l)) = wf log(l + PiLf(l - 9f)Rf) with respect to 9f to get 
the root £f(l). Each BS adjust its own power allocation ratio such that Of (I) = max{^ c (l), rf'*}. Under the 
power allocation vector C (1) = [9f(l), O^il), • • • , 91(1)], the common MS broadcasts the measured receive 
SINR r max (l) to all the BSs. 

• Step 2: Power Allocation Update: 
Update index i := i + 1 

We choose T(i) = (T mm (i — 1) + r max (z — l))/2, each BS solves the equation tu c log(l + T(i)) = 
wf log(l + PiLf(l — 9f)Rf) to get the solution £f(i). Each BS adjusts its own power allocation ratio 
9f(i) = max{£f(i),r]i'*}. 

• Step 3: Signal and Interference Plus Noise Estimation: 

The common MS broadcasts the measured receive SINR T(i) to all the base stations. 

• Step 4: Termination: 

if r(i) < rg), r min (i) = r(i) and r max (i) = r max (i - 1). 
if r(i) > r(i), r max (i) = r(<) and r min (i) = r min (i - 1). 

If T(i) = T(i), terminate; otherwise, go to Step 2. 



Theorem 2: The proposed iterative power allocation algorithm in Algorithm 2 converges to the unique NE defined 
by ([13. 

Proof: Please refer to Appendix [C] for the proof. ■ 
Remark 5 ( Complexity and Signaling Overhead): The computation of the long-term power allocation is dis- 
tributed at each of the K BSs. Furthermore, the iterations are done over a long time scale (instead of short-term 
CSI time scale) where each common MS shall broadcast its long-term SINR, which is a scalar, in each iteration. As 
a result, the signaling overhead is very low. Furthermore, as illustrated in Fig. [3l the algorithm has fast convergence 
and hence, the total iterations required are very limited. 

Fig. [3] shows the convergence property of the proposed power allocation algorithm as specified in the figure 
caption. Through the numerical studies, we found that the proposed long-term power allocation algorithm is shown 
to have a fast convergence. 

V. Numerical Examples 

This section provides some numerical examples to verify the behavior of our proposed long-term power allocation 
strategies in a network MIMO configuration with 10 cells arranged in a hexagonal manner. Each cell has 5km radius 
and there are 50 type-I MSs (regular MS with weight 1) and 50 type-II MSs (higher priority MSs with weight 2) 
in the system. These MSs are moving with high speed in the coverage according to a random-walk model Bull . We 
also assume each BS is equipped with 4 antennas, namely N t = 4, and N£ = Nf = 2. Each MS is equipped with 2 
antennas, namely N r = 2. The path gain model is given by PG(dB) = —130.19 — 37.6 log 10 (d(km)) and shadowing 
standard deviation is 8dB as specified in the IEEE 802.16m evaluation methodology Ell . For illustration purpose, 
we compare our proposed long-term power allocation algorithm with the following baseline schemes. Baseline 1: 
Orthogonal-Division (TDD/FDD) Based Strategy, i.e., the BSs serve the private and common MSs alternatively in 
different time/frequency slots. Baseline 2: Uniform Power Allocation, i.e., the BSs serve the private and common 
MSs simultaneously with uniform power allocation. Baseline 3: Centralized Long-term Power Allocation, i.e., the 
BSs serve the private and common MSs simultaneously using our open-loop SDMA scheme but the long-term 
power allocation is computed by the brute-force centralized numerical evaluations of problem ([8]). 

Fig. H] shows the minimum weighted throughput comparison for different open-loop schemes with respect to the 
BS transmit power. From the numerical examples, we notice that if we apply the open-loop overlaying scheme 
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in a naive manner without careful long-term power allocation, the system performance can even be worse than 
the traditional orthogonal-division based open-loop scheme (baseline 1 over 2). However, if we utilize the open- 
loop overlaying scheme with careful long-term power control, the system performance can be greatly improved 
(baseline 3 and the proposed scheme over baseline 1 and 2). Moreover, the proposed low complexity distributive 
power allocation algorithm can achieve significant performance advantage over the traditional orthogonal-division 
(TDD/FDD) schemes (baseline 1) and has negligible performance loss with respect to the centralized scheme 
(baseline 3). From the simulation results, we observe that the performance of the proposed distributive long-term 
power allocation algorithm is close-to-optimal (baseline 3). In other words, the NE of the partial cooperative game 
in (fT2l is quite "social optimal". 

Fig. [5] and Fig. [6] show the minimum weighted throughput comparison for different open-loop schemes with 
respect to the private and common MS set thresholds and £ c in Algorithm 1 respectively. We observe the 
performance gain of the proposed open-loop overlaying scheme over various baselines at various threshold values. 

Moreover, the numerical result in Fig. [5] and Fig. [6] again show that the NE of our partial cooperative game in 
(fT2l) is almost "social optimal". 



VI. Conclusion 

In this paper, we proposed an open-loop overlaying scheme to adjust the dynamic and heterogeneous MIMO 
configurations in the network MIMO systems. To exploit the heterogeneous path gain effect among multiple cells, 
we propose a distributive low complexity long-term power allocation algorithm with provable convergence property 
which only relies on local channel statistics. Through numerical studies, we show that the proposed open-loop 
overlaying scheme with distributive long-term power allocation algorithm can achieve significant performance 
advantages over the traditional schemes and has negligible performance loss compared with the centralized scheme. 



Appendix A 
Proof of Lemma [T] 

Denote C c to be transmission data rate of the OSTBC S c , also denote Cf,l G U\ to be the maximum decodable 
rate at which the Z-th private MS (I 6 Uf) can successfully decode the steams for the common MS set, i.e. OSTBC 
S c . The expression of Cpi is given by: 

Cf = E[log(l + ->f)] (14) 

where 7? is the instantaneous SINR at the Z-th MS (in the k-th private MS set) when decoding OSTBC S c by 
treating OSTBC as noise and is given by: 



= —r-r-^ — ^tiy 1 G u l (is) 




where Tr(-) denotes the matrix trace operation. Hence SIC can be performed at the /-th MS only when C c < Cf 
and the achievable throughput for the /-th MS after SIC is given by 

C,=E[log(l + Aj)] leU p k (16) 

where A/ denotes the instantaneous SNR (after the interference cancelation) at /-th MS (in the fc-th private MS set) 
and is given by: 



Tr{Z,Zf} 



A/ = * ^.r^Jtn ' 1 G U fc (17) 



9 The near social optimal performance of the NE is due to the partial cooperative component in the game formulation in d 1 2b . 
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Let Cj = E[log(l + € U c be the maximum decodable rate for the OSTBC S c at the j-th MS (in the 

common MS set) with 7? denoting the instantaneous SINR at the j-th MS (in the common MS set). The expression 



of 7? is given by: 



H 



NF 



L H ifci2 S' 



^ < ( Ef =1 



(18) 



If we choose the transmission rate of OSTBC S c to be C c = min JgW c ; gW p fc=1 ^{C^,C ; P }, all the private MSs 
can successfully perform SIC and the result throughput expression for the MSs in the common MS set and private 
MS sets is given by 



r 



HfC c = ^ L ^^jeW,ieU p k ,k=i,....K{Cj,Cf}, m£U c 
E[log(l + A m )], meti£,k = l,...,K 

where C?,j G U c \ Cf , I G Ug, k = 1, . . . , K and E[log(l + A m )] ,m G U£, k = 1, . . . , K are given by: 
C| = E [log (I + 7!)] 



log 1 + 



E 


H 












E 


Tr{ ( Ef=i V^H jM S p + Zi ) ( Ef=i ^^H iM S£ + z/ } 



log 1 + 







E 


T ( PkL jk e c k yj cccc.HttH \ 




E 


Tr( Z ,Zf); 


+ Ef=iE 


\ n jk,l s k s k "-jk,! J 



(19) 

(20) 
(21) 

(22) 



Cf = E[log(l + 7 f)] 



E 



log 1 + 



Tr 



- — t±l k2 z> 



P k U 



Ni 



kV k I I CC 



H 



E 



Tr 



log 1 + 



E 


' Tr |S^£ H{M S c S c - ff H^ 2 }" 




E[Tr(Z,Zf)] +E 







E[log(l + A m )] « log 1 + 



E[Tr(^^H mM S p S^ M ) 
E[Tr(Z m Z£)] 



,meU%,k = l,... ) K 



(23) 
(24) 



leULk = l,...,K (25) 



(26) 



where the approximation is asymptotically tight when the matrix size iVf is sufficiently large. Fig. |7] illustrates the 
quality of the approximation of (1241) for reasonable values of iVf (4x4 MIMO configurations). As shown in the 
figure, the quality of the approximation is quite good even for small Nf. Equations d22l) . (1261) and (1251 ) are derived 
from the independent property between the noise and the transmitted signals. 

Due to the OSTBC transmission structure at the BS side, the above relations can be simplified and we have lfT2l . 

ma 



s c s c,H 



^\Kj 2 I Nr =R p k Tl N[ ,Vk = 1,2,..., K 
i=i 

j^Kfl^ = R c TL Nf 

2=1 



(27) 
(28) 



IEEE TRANSACTION ON WIRELESS COMMUNICATIONS, ACCEPTED 



1 1 



J> l|2 



where we use the assumption that the transmitted symbols are normalized to unity, i.e., \\s k - u 
all k. Hence, the following relations can be directly derived. 



V\\2 



E 
E 



Tr 



H 



mk,2 



mk,2 



k^rak^k 1 



ir I — t^mk^k^k M 



rnk,l 



P k L c mk e p k R p k TN r 



1 for 

(29) 

(30) 
(3D 



E[Tr(Z^Z^)] = N r T 

Substitute the above relations into (l22l . (125T ) and (l26l ) and denote the approximate value of C m ,Cj, Cf and E [log (l + X m )\ 
with C m ,C C j,C P and log (l + PkL m kO k R k ) respectively, we have Lemma [TJ 



Appendix B 
Proof of Theorem Q] 

We shall provide the proofs of the existence and uniqueness in the following two subsections. 



A. Existence 

To prove the existence of the NE in the game we shall first characterize the following two properties of the 
payoff functions in the game & '. 

1) The payoff function £k(9 k ,0^ k ) is continuous in C . 

We first notice that WjCj,j G U c , gf,{9 k ,6 c _ k ) and f k (9 k ,6 c _ k ) are continuous functions with respect to 
6 C . Since the minimum operation preserves the continuity property of the original functions, = 

mm{fl(9 c k ,e c _ k ),fk( d h -k)} is thus continuous in 6> c . 

2) The payoff function is quasi-concave in 9 k . 

Since WjC^j £ U c , gl(0 k , and f k {Q k , 9-^) are the composition of a linear fractional function (which is 
quasi-concave in 9 k ) and a logarithm function (which is non-decreasing), we can conclude that WjC C j,j € U c , 



and /!( 



fe' 



c _ k ) are quasi-concave in 9?. Moreover, since the minimum operation preserves the 



quasi-concavity, we can prove that the payoff function dk{9 k ,d'L k ) is quasi-concave in 9 k . 
In addition, since the admissible strategy set of player k, is a nonempty, convex and compact subset of Euclidean 
space R, we can conclude that at least one NE exists in the game W which is the direct result of |[22l - |[26ll . 



B. Uniqueness 

To prove the uniqueness of NE, we shall provide the following two Lemmas before the main proof. 
Lemma 2 (Monotonicity): Define f 1 {9 k ^6 c _ k ) to be monotonic increasing in C , if 

f 1 (e c k ,e c _ k )>f\9t',e c _ k ) 

for all C y and the equality holds if and only if C = 6 C . It can be shown that <7 1 (0 C ), f l {6 c ) is monotonic 
increasing in 6 C . Also we can observe that for given 0°_ k , g l {9 k ^9 c _ k ), g\{9 k ) and f k {Q%,d < Li s ) is monotonic 
increasing in 9 k . f k (9 k ) is monotonic decreasing in 9 k . 

Lemma 3 (Utility Solution): Given 8 c _ k , the maximum value of the player Z's utility function 

<Efc(0£,0I fc ) =™in{fl{9%,Ot k )Jl(9%)} 

exists and is unique. Moreover, we have f k {9 k * ,6 ( L k ) = at the maximum point 9 k *. 

Proof: The proof of Lemma [2] is straight-forward and hence omitted due to the page limit. The sketched proof 
of Lemma [3] can be summarized as follows. 

For any given 0°_ k , from Lemma|2j we can find that f 1 (9 k , reaches the minimum value at the point 9i = 
and the maximum value at the point 9 k = 1. Similarly, f k {9 k ) reaches the maximum value at the point 9 k = and 
the minimum value at the point 9 k = 1. Since fli^h^-k)/fk(^k) ^ s continuous and strictly increasing/decreasing 

l0 In this paper, >^ means componentwise larger or equal. 
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with respect to 9 c k , combing with the fact that fl(0,Ot k ) < /£(0) and fj^(l,0t k ) > 

we can conclude that 

there exists a unique point 9 k * G (0,1) such that fc(9i* \0^_ k ) = which corresponds to the maximum 

value of mm{f 1 (9 c k ,e c ^ k ),f 2 (9 c k )}. Hence, Lemma [f] follows. ■ 
With the well elaborated Lemma [2] and Lemma [3] we shall prove the uniqueness of NE in our game & = 
[Jzf , {S>k}-, {£&(•)}] through the mathematical contradiction. Suppose there exist two different NEs G and 6 . Without 
loss of generality, we can category the relationship between g 1 (6),g 1 (6 ) into the following three classes: 

1) g 1 {9) < g 1 (6 ) , we can conclude that there must exist j £ if such that 9j < 9-. 

a) Applying Lemma H we have g](9 J ) < g](9' j ) and f?(6j) > fjty). 

b) Applying Lemma [3j we have fj(G) = /?(%) and fj(0') = f]{9'-). From the relation that /?(%) > 
ff(9' j ), we have f}{6) > f}{0'). 

c) Since fj(G) = mm{g 1 (0),g](9 3 )}, f}{0') = min^'), g^)}, and g\6) < g\9'), g](9 j ) < 
g](9' j ), we have f}{6) < f}{0'). 

Thus, a contradiction between step [lb] and step [Tc] has been found. 

2) g 1 {6) > g 1 (0'), the proof shall follow the same lines as in the previous case with 6 and swapped. 

3) 9 l {6) = g 1 ^ )» since 6 and are two different NEs, without loss of generality, we assume the jth component 
of the power allocation ratio vector satisfy 9j < 9-. 

a) Applying Lemma [2] we have ff(0j) > fj(6j)- 

b) Let g l {9) = g 1 (6 ) = A, and the relation between ff{9j) and fj{9j) can be characterized as follows. 

i) ifA<g](9 J )<g](9' j ) 

f}(0) = mm{gH0),g](9j)} \ = A \ 2 _ 2 , 

f}(0') = min{gHe'),g](9' ] )} J ffty = A ) ^ ~ Wi) 

ii) if g^Oj) <A< gfrOj) or g 2 (9 ) <A< g$(0'.) 



f}(0)=min{gi(O),g](9 J )} \ fjiOp g) 
f}(0') = mm{gi(0'),g](9' 3 )} J fjty) = A 



hi) if g](9 j )< g] (9-) < A 



f}(e)=mm{gH0),g](9 ] )} \ fj(9 J )=g](„ ]J , , 
f}(0') = min{gHe'),g](9' ] )} J fj(0>) = g](9' J ) )*hWi)<W S ) 

The first "=>■" is based on Lemma|3]and the second one can be easily verified through basic mathematical 
relations. Combining with the above three cases, we can conclude that ff{0j) < fj(@j)- 
Thus, a contradiction between step [3a] and [3b] has been found. 
In summary, since we can always find a contradiction with the assumption that there exist two different NEs, 
we can draw the conclusion that the NE is unique in the non-cooperative game . ■ 

Appendix C 
Proof of Theorem [2] 

To prove the convergence property of the proposed iterative power allocation algorithm, we shall first establish 
the following relation about the optimal power allocation ratio. 

Lemma 4 (Optimal Power Allocation): For each player k, given the other players' power allocation Q c _ k , the 
maximum value of the payoff function could be determined through 

ft*= max (32) 

where 9% j are the solutions to wfi)(9 c k , Q c _ k ) = fi(9 c k ),j G U c and 0% l , I G U p is the solution to g 2 k (9 c k ) = fl{9 c k ), 
respectively. 

Proof: Applying Lemma[3] the maximum value of the payoff function satisfied the following relation f k {9 k *, 6 c _ k 
f k (9 k *) and the optimal power allocation ft k * is one element in the set {6 k \ j £ U c ,l G U k . Due to the 
mono tonic increasing property of the function fl{9 k ,0^ k ) with respect to 9 k , we have Lemma [4] ■ 
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We now give the rigorous proof of Theorem [2] as follows. Without loss of generality, we assume C * to be the 
unique NE in the non-cooperative game and the value of g l (6 c '*) is equal to be A*. For all k € {1, 2, . . . , K}, a 
direct result of Lemma [4] shows that = max{a£'*, %.'*}, where a c k '* and rji'* are the solutions to the following 
equations. 

= A* (33) 

ft^l) = dlivt) (34) 

Since 7] k * can be locally determined, the remaining is to find the value of A* through numerical algorithms. 
A standard bisection search based argument E71 with provable convergence property can be applied. Combining 
with the unique property of the NE as established in Theorem [TJ the optimal power allocation ratio 6 C * can be 
determined. To improve the speed of the convergence, we shall properly set the initial conditions as follows. 

1) A min = g 1 ^*) <gl(O c >*) =A*. 

2) A max = gl(e c (l)) >g\(9 c >*) = A* 

where C (1) = . . . ,0^(1)] and 9 c k {l) = max{l - € U p k . 
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Common MS set, cooperatively 




served by BS1, BS2, BS3, respectively 



Fig. 1. An example of network configuration containing three cells. MSs in the solidline red circle represent the common MS set which 
is served cooperatively by all the BSs within coverage. MSs in the solidline blue circle represent the private MS sets which are served 
non-cooperatively in the coverage of BS1, BS2, and BS3, respectively. 



X, : stream for MSI 




Cell 2 Cell 1 



Fig. 2. An illustrative diagram of the transmit structure at each BS. For example, the BS1 and BS2 are serving the common set (MSI and 
MS2) cooperatively with the same OSTBC (in which stream xl is for MSI and stream x2 is for MS2) and serving the private MS3 and 
MS4 using two different OSTBC structures respectively. 
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Convergence Property of the Iterative Power Allocation Algorithm 
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Fig. 3. Convergence property of the proposed distributive long-term power allocation algorithm in a cellular network with 3 BSs. Assume 
the private MS sets and the common MS set have already been determined with our user scheduling algorithm. As a result, 3 BSs non- 
cooperatively serve 3 private MSs (MS1,MS2 and MS3) respectively and two (M° — 2) common MS (MS4 and MS5) cooperatively. In the 
simulation, we choose Pi = P 2 = P 3 = 30dBm;Ln = -118.30<iB, L 12 = -UO.lAdB, L 13 = -139.29ciB; L 21 = -145.11ciB, L 22 = 
-115.56ciB,L 2 3 = -143.23dB;L 3 i = -147.78(iB, L 32 = -139.65dB, L 33 = -116.35dS; L« = -135.24ciB, L 42 = 
-136.08dB, L 43 = -135.35ciB;L 5 i = -135.16dB, L 52 = -135.91dS, L 53 = -134.94d_B and the QoS weighting coefficients are 
given by wi = W5 = 2, w 2 — w 3 — W4 = 1. 



3.5 




Transmit Power (dBm) 



Fig. 4. Minimum weighted throughput comparison for different open-loop schemes with respect to the transmit power. Through the numerical 
examples, the minimum weighted throughput of the proposed distributive long-term power allocation scheme with finite iteration numbers 
can outperform the conventional orthogonal-division (TDD/FDD) based open-loop scheme (baseline 1) as well as the open-loop overlaying 
scheme with uniform power allocation (baseline 2) and has negligible performance loss compared to the open-loop overlaying scheme with 
centralized (global optimal) power allocation (baseline 3). In the simulation, we choose = . . . = ^ P K = 20dB and £ c = 5dB. 
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Fig. 5. Minimum weighted throughput comparison for different open-loop schemes with respect to the private MS set threshold in 
Algorithm 1. Through the numerical examples, the minimum weighted throughput of the proposed distributive long-term power allocation 
scheme with finite iteration numbers can outperform the conventional orthogonal-division (TDD/FDD) based open-loop scheme (baseline 1) 
as well as the open-loop overlaying scheme with uniform power allocation (baseline 2) and has negligible performance loss compared to 



the open-loop overlaying scheme with centralized (global optimal) power allocation (baseline 3). In the simulation, we keep £f 
and choose £ c = 5dB. Transmit Power of all the BSs are 35dBm. 
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Fig. 6. Minimum weighted throughput comparison for different open-loop schemes with respect to the common MS set threshold £ c in 
Algorithm 1. Through the numerical examples, the minimum weighted throughput of the proposed distributive long-term power allocation 
scheme with finite iteration numbers can outperform the conventional orthogonal-division (TDD/FDD) based open-loop scheme (baseline 1) as 
well as the open-loop overlaying scheme with uniform power allocation (baseline 2) and has negligible performance loss compared to the open- 
loop overlaying scheme with centralized (global optimal) power allocation (baseline 3). In the simulation, we choose Ci = • • ■ = — 20dB. 
Transmit Power of all the BSs are 35dBm. 
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Evaluation of Expression (23) v.s. Different Power Allocation Ratio 8f 
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Fig. 7. Illustration of the quality of approximation in {23} and j24b . 



